Analysis of Dynamic Load Balancing Strategies for Parallel Shared Nothing Database Systems
نویسندگان
چکیده
Parallel database systems have to support both inter-transaction as well as intra-transaction parallelism. Inter-transaction parallelism (multi-user mode) is required to achieve high throughput, in particular for OLTP transactions, and sufficient cost-effectiveness. Intra-transaction parallelism is a prerequisite for reducing the response time of complex and data-intensive transactions (queries). In order to achieve both goals dynamic strategies for load balancing and scheduling are necessary which take the current system state into account for allocating transactions and subqueries to processors and for determining the degree of intra-transaction parallelism. We study the load balancing problem for parallel join processing in Shared Nothing database systems. In these systems, join processing is typically based on a dynamic redistribution of relations to join processors thus making dynamic load balancing strategies feasible. In particular, we study the performance of dynamic load balancing strategies for determining the number of join processors and for selection of the join processors. In contrast to previous studies on parallel join processing, we present a multi-user performance analysis for both homogeneous and heterogeneous/mixed workloads as well as for different database allocations.
منابع مشابه
Dynamic Load Balancing in Parallel Database Systems
Parallel database systems have to support both inter-transaction as well as intra-transaction parallelism. Inter-transaction parallelism (multi-user mode) is required to achieve high throughput, in particular for OLTP transactions, and sufficient cost-effectiveness. Intratransaction parallelism is a prerequisite for reducing the response time of complex and dataintensive transactions (queries)....
متن کاملOn the Performance of Parallel Join Processing in Shared Nothing Database Systems
Parallel database systems aim at providing high throughput for OLTP transactions as well as short response times for complex and data-intensive queries. Shared nothing systems represent the major architecture for parallel database processing. While the performance of such systems has been extensively analyzed in the past, the corresponding studies have made a number of best-case assumptions. In...
متن کاملDynamic Load Balancing in Hierarchical Parallel Database Systems
We consider the execution of multi-join queries in a hierarchical parallel system, i.e., a shared-nothing system whose nodes are shared-memory multiprocessors. In this context, load balancing must be addressed at two levels, locally among the processors of each shared-memory node and globally among all nodes. In this paper, we propose a dynamic execution model that maximizes local load balancin...
متن کاملPerformance of Load Balancing Techniques for Join Operations in Shared-Noting Database Management Systems
We investigate various load balancing approaches for hash-based join techniques popular in multicomputer-based shared-nothing database systems. When the tuples are not uniformly distributed among the hash buckets, redistribution of these buckets among the processors is necessary to maintain good system performance. Two recent load balancing techniques rely on sampling and incremental balancing,...
متن کاملOnline Heat Balancing for Parallel Indexed Database On Shared Nothing System
Shared nothing systems offer a tremendous processing capacity. In such highly parallel systems the data are typically declustered across the system processing elements (PEs) to exploit the I/O bandwidth of the PEs. However, the access pattern is inherently dynamic, which in turn can lead to performance degradation as some PEs become hot spot (bottleneck). Therefore, reorganization for heat (loa...
متن کامل